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METHOD AND APPARATUS FOR GENERATING, DISTRIBUTING AND 
RECONSTRUCTING DECONSTRUCTED VIDEO 

BACKGROUND OF THE INVENTION 

5 

1 . Field of the Invention 

The invention generally relates to video distribution 
systems and, more particularly, to video distribution 
systems that distribute deconstructed video through 
10 communications networks. 

2 . Description of the Background Art 

Video transmission through communications networks such 
q as the INTERNET has become ubiquitous . Typically, the video 
'-y 15 is encoded using a compression algorithm such as MPEG-2 or 
q Quicktime and transmitted as a single compressed bitstream 

^ to the user. The user's equipment comprises a plurality of 

compressed-video decoders that can decode various forms of 
«P» encoded video. However, if the user's equipment is of a 
j\ 2 0 type that has limited computing power, the decoded image 
!«* sequence may be distorted, or otherwise have poor quality. 

! !J One method that is used to match the quality of the 

l«M" 

p encoded imagery to the capabilities of the user equipment is 

.sea. 

to encode a plurality of bitstreams where each stream 
25 represents a different level of image quality. The user 
then selects a bitstream from the plurality of encoded 
bitstreams. The quality of the video encoded into the 
selected bitstream matches the processing power of the 
user's equipment, i.e., a small hand-held computing device 
3 0 capable of low resolution video presentation would select a 
low resolution bitstream. As such, the user receives and 
decodes the selected bitstream in a manner that matches the 
capabilities of their device. 

Such systems require the user to apriori understand the 
35 capabilities of their decoder and select an appropriate 

bitstream. Many times such selection is not possible or it 
is accomplished by trial and error, i.e., the user 
repeatedly selects bitstreams until an appropriate one is 
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found. Furthermore , transmitting a plurality of encoded 
bitstreams where each bitstream comprises a full 
representation of an image sequence consumes an enormous 
amount of bandwidth within the communications network and a 
large amount of storage space on a server. 

Therefore, there is a need in the art for a method and 
apparatus that deconstructs the video in a hierarchical 
manner, transmits each level of deconstructed video in the 
hierarchy, and enables user equipment or a network interface 
device to reconstruct a video stream from select levels of 
the deconstructed video that is commensurate with the 
capabilities of the user equipment. 

SUMMARY OF THE INVENTION 

The present invention provides a method and apparatus 
for deconstructing a video signal (a sequence of video 
frames) into a hierarchical structure, distributing the 
deconstructed video, and reconstructing the video signal 
such that the video signal quality of the reconstructed 
video signal matches the capabilities of the user device 
that is to decode and display the video. To begin the 
deconstruction process, the video image sequence may be 
divided into one or more constituent components, i.e., 
foreground objects are separated from the background, moving 
objects are separated from the background, faces of people 
are separated from remaining video, text or graphics are 
separated from the remaining video, and so on. Of course, 
the invention does not require the video sequence to be 
divided into its components parts such that the intact video 
image sequence can be encoded. The components or the entire 
video image sequence may be deconstructed using a single 
dimension deconstruction process or a multidimensional 
deconstruction process. The single dimension deconstruction 
process creates a hierarchy along a single characteristic of 
each component. For example, each component of the video 
may be deconstructed along a resolution characteristic by 
producing a first bitstream that is encoded using a low 
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resolution encoder, a second bitstream is created using a 
slightly higher resolution encoder, and so on. Such 
deconstruction along a single dimension (e.g., image 
resolution) is one example of the many dimensions that could 
5 be used. Other examples include color depth, frame rate, 
display type, and the like. 

In one embodiment of the invention, multidimensional 
deconstruction is accomplished by generating a partial order 
representation of the deconstructed elements for each 
10 component of the video signal. Each point in the partial 
order represents a particular valid combination of encoding 
parameters. For example, the video image sequence can be 
processed to extract background from foreground information 
s -~ within the video signal, the background information can be 

G 15 encoded using a plurality of frame rates, while the 

foreground material can be coded using another plurality of 

"~*4 

r-3 frame rates. Upon reconstruction, the user device may be 

'=£ provided a low frame rate for the background and a high 
" p frame rate for the foreground. Additionally, the foreground 

M 2 0 material can be encoded using a higher resolution than the 

background material. In fact, many levels of resolution and 
fj) frame rates can be used and then selectively combined for 

! =3 the user device. As such, the image sequence is 

. deconstructed into components which are each then 
25 deconstructed into multiple dimensions. 

Each combination of encoding parameters represented by 
a required set of points in the partial order is encoded 
into a bitstream for transmission through a network. The 
separately encoded bitstreams are transmitted from one 
30 network interface device (NID) to another through the 

network. The network may be the INTERNET, cable television 
network, satellite network, wireless network or a 
combination of networks. The bitstreams may be pointcast, 
multicast or broadcast through the network. The protocol 
35 used for transmission is any one that is supported by the 

network. The NID that receives the bitstreams, reconstructs 
the video signals into a format that can be decoded by the 
user equipment or, if the NID is an intermediate node in the 
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network, reconstructs the video signals into a format that 
can be decoded by other "downstream" NIDS . The NID at the 
edge of a network then transmits the reconstructed stream to 
the user equipment. The protocol used to propagate the 
5 video to the user equipment may be MPEG- 2 or some other 
compression protocol that is suitable for reception by the 
user equipment . 

The edge NID forms a universal translator that resides 
at the edge of a network and receives the bitstreams from 
10 the network in their deconstructed state and reconstructs 
the video signal using one or more of the bitstreams. The 
bitstreams are selected based upon the capabilities of the 
user equipment to decode the video signal. For example, if 
£ the user equipment is capable of receiving low resolution 

■*g 15 MPEG-2 signals having 24-bit color depth and a 10 fps frame 
rate, the NID selects bitstreams that have the elements of a 

"re 

□ low resolution bitstream, a 24-bit color depth and a 10 fps 

frame rate. The selected elements are combined to form an 
i5 ' MPEG-2 compliant bit stream having the required color depth 

; ss * 2 0 and frame rate. The compliant bitstream is coupled to the 
; s =% user equipment for decoding and presentation. 

Cm 

j 3 BRIEF DESCRIPTION OF THE DRAWINGS 

2 5 The teachings of the present invention can be readily 

understood by considering the following detailed description 
in conjunction with the accompanying drawings, in which: 

FIG. 1 depicts a block diagram of a system for 
generating, distributing and reconstructing deconstructed 

3 0 video; 

FIG. 2 depicts a block diagram of a network interface 
device (NID) that creates deconstructed video for 
transmission; 

FIG. 3 depicts a flow diagram, representing the 
3 5 operation of the NID of FIG. 2; 

FIG. 4 depicts an illustration of a video element 
lattice used for generation and reconstruction of a 
deconstructed video bitstream; 
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FIG. 5 depicts a block diagram of a NID that 
reconstructs the video sequence for a user device; 

FIG. 6 depicts a flow diagram representing the 
operation of the NID of FIG. 4; and 

FIG. 7 depicts a block diagram of an alternative 
embodiment of the invention. 

To facilitate understanding, identical reference 
numerals have been used, where possible, to designate 
identical elements that are common to the figures. 

DETAILED DESCRIPTION OF THE INVENTION 



FIG. 1 depicts a block diagram of a system 100 that 
! ^ generates, distributes and reconstructs deconstructed video. 

15 The system 100 comprises at least one network interface 
=? device (NID) 104 for generating deconstructed video, at 

■g least one NID 108 for reconstructing the video once 

received, and a network 106 for propagating the 
deconstructed video from NID 104 to NID 108. Also depicted. 
20 are a source 102 of video for NID 104 and a user device 110 
coupled to NID 108. These NID's generally reside at the 
"edge" of the network (i.e., edge NIDs) to couple content to 
the network or provide content to a user device or to 
another network. The video source can be any source that 
25 provides a sequence of image frames forming a video signal. 
The video signal may or may not be compressed or encoded. 
If encoded, the NID 104 decodes the encoded video signal and 
then performs a video deconstruction process. The 
deconstruction process, as described in detail below, may be 
30 based on any parameter, characteristic or property of the 
video, any characteristic or parameter of the network, or 
any characteristic or parameter of the user devices as well 
as any combination of characteristics, parameters, and 
properties of the system components. 
35 The user device 110 comprises a video decoder that 

receives a video signal, typically an encoded video signal, 
from the NID 108. The NID 108 receives the deconstructed 
video bitstreams from the network 10 6 and produces a video 
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signal that is suitable for the user device 110. The NID 
108 forms, in essence, a universal video translator that 
translates the plurality of deconstructed video bitstreams 
into a signal that is suitable for processing and display by 
5 the user device 110. 

In one embodiment of the invention as used in an 
Internet environment, the video source 102 may be a content 
server, the NID 104 may be a host server for a website that 
contains video, and the NID 108 may be a cache server in the 

10 user's "neighborhood". Such caching is well-known in the 
art as a way of reducing content delivery latency forhighly 
trafficked websites. As such, the website and its video, 
represented in a deconstructed form, are stored in the NID 
108. The deconstructed video would be selectively 

15 reconstructed depending upon the capabilities of the user 
device that requests the website pages. 

Alternatively, in another embodiment of the invention, 
NID 108 can be a gateway to another network (i.e., a 
translation node) . For example, the NID 108 may select and 

2 0 translate deconstructed video bitstreams into video signals 
having characteristics that can be transmitted through a 
wireless network. Or, the network may contain intermediate 
NIDS that reconstruct and deconstruct the bitstreams as 
needed to efficiently propagate the video to end users 

2 5 through various networks or portions of networks. A 

detailed description of a network having intermediate nodes, 
translation nodes and edge nodes is described below with 
reference to FIG. 7. 

FIG. 2 depicts a block diagram of the NID 104 and FIG. 

3 0 3 depicts a flow diagram representing the operation of the 

NID 104. To best understand the operation of the NID 104, 
the reader should simultaneously refer to both FIGs . 2 and 3. 

The NID 104 comprises an optional decoder 200, a 
processor 2 02, a memory 2 04, processor support circuits 2 06, 
35 and a network interface circuit 208. The decoder 2 00 is 

only used when the video source provides encoded video such 
as MPEG-2 compliant video or some other compressed video 
sequence. The decoder 200 converts the encoded video into a 
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sequence of video frames (also referred to as a video image 
sequence) where each frame comprises an array of pixels. 
The video image sequence is coupled to the processor 202 for 
image processing and video deconstruction. 
5 The processor 202 is a general purpose processing unit,* 

that, when executing programs that are stored in memory 2 04, 
becomes a specific purpose processor that deconstructs the 
video signal to form one or more deconstructed video 
bitstreams. Although only one processor is shown, those 
10 skilled in the art will realize that a plurality of 

processors may be used in parallel to perform the functions 
necessary to produce deconstructed video. The support 
circuits 206 comprise well-known circuits such as power 
? supplies, cache, clocks, and the like that facilitate the 

ra 

^ 15 functionality of the processor 202. The memory 204 may 
=J comprise random access memory, read only memory, flash 

«?5 memory, removable storage, hard disk drives, or any 

'5 combination thereof. The memory 204 stores a video 

deconstruction routine 210 that when executed by the 
2 0 processor 2 02 causes the processor 2 02 to become a video 

deconstruction processor. The network interface circuit 208 
uses a network transmission protocol, e.g., asynchronous 
transfer mode (ATM) , to organize and transmit the 
deconstructed video bitstreams through the network. 
25 Although the network interface device 104 is depicted 

as a general purpose computer that, when executing the 
software of the invention, performs the method of the 
invention, those skilled in the art will understand that the 
device 104 can be implemented using, for example, an 
30 application specific integrated circuit (ASIC) . 

FIG. 3 depicts a flow diagram of the video 
deconstruction process performed by the processor 202. The 
video deconstruction process performs three major 
subprocesses . First, the video image sequence is analyzed 
35 and divided into components with respect to characteristics 
in the video. Second, a base bitstream is formed that 
contains video information so that any user device connected 
to the network can receive the base bitstream. Lastly, each 
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component is encoded using one or more encoding dimensions 
to form various augmenting bitstreams that are used to 
enhance the base bitstream. As such, various combinations 
of the base bitstream with selected ones of the augmenting 
5 bitstreams will create various video sequences having 
different properties. 

The deconstruction process begins at step 3 01 and 
proceeds to step 302. At step 302, an operator (or an 
automated routine) selects the appropriate video components 
10 that are to be extracted from the video image sequence. The 
components that can be extracted include, but are not 
limited to, character recognition to differentially encode 
text versus moving video; face recognition to differentially 

s :~ encode faces versus the remaining video; motion detection to 

M 

Hj 15 differentially encode moving objects versus stationary 

;=% 

H objects; object model encoding; encoding fixed regions 

p differently than other regions; encoding infrared geometric 

shapes differently than other regions of an image; encode 
explicitly annotated regions differently than other regions; 

2 0 and chart or shape recognition to encode certain regions 
containing graphics different from other regions. The 
deconstruction method is most efficient when the extracted 
components are orthogonal . If the components are less then 
orthogonal, i.e, . information from one component is 

25 contained in another component, then the deconstructed video 
requires more bandwidth than is used by deconstructed 
orthogonal components. Once selected, step 3 03 performs 
conventional component extraction to form separate video 
sequences containing the extracted components. Of course, 

3 0 the operator may select that no components be extracted. 
At step 3 04, the routine queries whether the NID is to 

apply a single dimension deconstruction or a multiple 
dimension deconstruction to each of the components of the 
video image sequence. If a single dimension deconstruction 
35 is to be performed, the routine proceeds to step 306. 

At step 306, the user selects a dimension to use for 
deconstruction. Alternatively, the NID may automatically 
select the dimension. These dimensions may comprise image 
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resolution, image frame rate, encoding bit budget, color 
depth and the like. 

The selected dimension is divided into a hierarchical 
structure of levels where each level represents a change in 
the characteristics of the selected dimension. For example, 
if the video image sequence was divided into two components 
and if the selected dimension is the resolution of the 
image, each level of deconstruction represents a change 
(e.g., an improvement) in the resolution of the image 
stream. Each component is represented by multiple 
resolution levels and a bitstream is generated for each 
level. As such, the video image sequences that comprise 
each component are subsampled to form frames of lesser 
resolution. This subsampling process is known as forming an 
image pyramid. Once a pyramid is formed for one or more of 
the frames of video, the various levels of the pyramid are 
processed to form a bitstream containing the information in 
each level across a plurality of video frames. For example, 
five levels of an image pyramid may be created for each 
image frame, and the first level information from each 
pyramid is processed to form a first bitstream (base 
bitstream) , the second level information is formed into a 
second bitstream (a first augmentation bitstream) , and so on 
until the fifth level (the highest resolution) is formed 
into a fifth bitstream (fourth augmentation bitstream) . The 
dimension-based encoding is performed on a video image 
sequence having a plurality of extracted components, where a 
base bitstream is encoded using the remainder of the 
original video sequence after the components are extracted 
and then each extracted component is encoded using the 
dimension-based encoding. Alternatively, all the extracted 
components can each be dimension-based encoded without 
forming a single base bitstream. The bitstreams are then 
coupled to the network interface circuit 2 08 for 
transmission through the network to a user. Once all the 
video streams are processed, the routine ends at step 310. 

If multidimensional deconstruction is to be performed, 
the routine proceeds from the query at step 3 04 to step 312. 
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At step 312, the routine 210 selects a plurality of 
dimensions that are to be used in the deconstruction process 
to dimension-base encode the components of the video image 
sequence. These dimensions may include frame rate, 
5 resolution, black and white versus color, display type (VGA, 
XVGA, etc.)/ and the like. This list of dimensions is not 
considered to be exhaustive. Those skilled in the art could 
derive many other dimensions for encoding that are intended 
to fall within the scope of this invention. Any combination 
10 of these encoding dimensions could be performed by the 
routine to encode the video signal. 

At step 314, the multidimensional encoding process 
processes the video stream for the selected dimensions using 
if a hierarchical processing. This is similar to the process 

*j 15 described earlier as embodied in step 3 08, but repeated for 
Q each dimension. 

Once the bitstreams have been calculated for each 
=5 dimension, the routine calculates, at step 316, a common 

base bitstream from the plurality of bitstreams calculated 

is 

j=£ 20 in step 314. This process involves calculating the common 
j!^ components between each of the dimensional base bitstreams, 

?f\ resulting in the common base bitstream being a further lower 

Q point in a partial order representation of the augmentation 

streams for all of the chosen dimensions. The base 
25 bitstreams of each individual dimension are now considered 

to be augmentation bitstreams over the common base 

bitstream. 

This common base bitstream will constitute the base of 
a lattice (i.e., a specific partial order). Figure 4 

30 depicts an illustrative lattice 400 for a two dimensional 
encoder. In the illustration, the lattice shows a first 
dimension (axis 402) being the foreground quality and the 
second dimension (axis 404) being the background quality. 
The node 410 represents the common base bitstream, and nodes 

35 along axes 402 and 404 representing the augmentation 

bitstreams for each of the dimensions. The intersection 
points on the lattice (shown as an H x" 408A, 408B, 408C and 
so on) represent possible combinations of the base bitstream 
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with a plurality of augmentation bitstreams to form an 
" improved" video image . 

Note that in a more general embodiment of the current 
invention, there may not be a common base bitstream other 
5 than the trivial or empty bitstream. In these cases, the 
multi-dimensional deconstruction can be considered a partial 
order, and we do not transmit the (empty) common base 
bitstream. Also, there may not be a unique least 
improvement bitstream above any two given points, and again 
10 the multi-dimensional deconstruction can be considered a 

partial order (as opposed to lattice) , and a choice is made 
among the points dominating a given pair of points, and the 
corresponding base and improvement bitstreams are selected. 
Each of the bitstreams (including all base bitstreams, 
j 15 common base bitstream and augmentation streams) are then 
Q coupled to the network interface circuit 2 08 for 

transmission onto the network 106. At step 318, the 
network interface circuit 2 08 performs the required 
packetization of the bitstreams and the inclusion of a 
20 packet header and other routing information to properly 

route the packet through the network 106. The packetization 
]f\ process may comply to one of many available protocols, e.g. 

: |3 ATM, MPEG-2, and the like. The routine 210 ends at step 

320. 

25 Although the method is described as creating a 

plurality of deconstructed bitstreams that together 
represent a video sequence, the invention may combine the 
various bitstreams into a one or more bitstreams in a manner 
similar to the use of transport streams in an MPEG-compliant 
30 transport stream. 

FIG. 5 depicts a block diagram of NID 108 and FIG. 6 
depicts a flow diagram representing the operation of NID 
108. NID 108 comprises a network interface circuit 502, a 
processor 504, a memory 506, support circuits 508 and user 
35 device interface circuits 510. The network interface 

circuit 502 is coupled to the network 106 to receive the 
bitstream(s) therefrom. The bitstreams are coupled to the 
processor 504 for reconstruction into a video signal. The 
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network interface circuit receives a plurality of the 
bitstreams representing a particular program that have been 
deconstructed by the NID 104. Depending on the user device; 
the processor 504 selects the appropriate video bitstreams 
5 to be used for reconstruction of a video stream that is 
commensurate with the quality depiction capable of user 
device, i.e., the processor 504 selects a base bitstream 
plus one or more augmentation bitstreams for a particular 
node of the lattice. 

10 The processor 504 is a general purpose computer that, 

when executing particular software, becomes a specific 
purpose computer. For example, the processor 504 executes 
video reconstruction routine 512 that is stored in memory 
O 510 to cause the processor 504 to become a video 

15 reconstruction processor that reconstructs the video signal 

""-4 

Q from selected bitstreams. Alternatively, the processor 504 

.=2 may be a specific purpose computer, i.e., an application 

;5 £ specific integrated circuit (ASIC) , designed to perform 

,s N video reconstruction. Further processing of the 

\ iA 2 0 reconstructed video stream may be accomplished by the 

processor 504 executing the video processing routine 514. 
,:p Such additional processing is used to create a video 

□ bitstream that is compliant with the user device 

I: ~ requirements, e.g., creating an MPEG-2 compliant bitstream, 

25 a Quicktime bitstream, and the like. The reconstructed 
video bitstream is coupled from the processor 504 to the 
user device interface circuit 510. This interface circuit 
510 operates in conjunction with the processor 504 to 
convert the video bitstream into a signal that can be used 
3 0 by the user device. Such a stream may be baseband video, S- 
video, MPEG-2 video, Quicktime video, NTSC television 
signals, and the like. For example, if the user device 110 
were a standard television, the processor 504 generates a 
digital video signal (e.g., a sequence of video frames) and 
3 5 the circuit 510 produces a NTSC signal for reception on 
channel 3 of the user's television. 

The NID 108 may operate as a gateway to another network 
such that the NID translates the deconstructed video into 
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another form that can be distributed on a second network. 
This is particularly useful for a gateway that couples a 
wired network to a wireless network, where the wireless 
network has a narrower bandwidth than the wired network. As 
5 such, users of the wired network may access any lattice node 
while users of the wireless network may access only low 
quality nodes. 

Alternatively, in another embodiment of the invention, 
the use may request (order) various grades of video content. 
10 As such, a service provider may charge various rates that 
are commensurate with the quality of video that is ordered. 

Another alternative embodiment of the invention uses an 
eye tracking device that selects a region of a video display 
p that is of most interest, i.e., the component being viewed, 

15 and the user device requests that an augmented bitstream(s) 

"-"4 

■Q be supplied to enhance that particular region of the 

' H imagery, i.e., higher resolution can be applied to that 

Q 

ts g particular portion. In lieu of an eye tracking device that 

=4* requests selective enhancement in real time, the system may 
i ss = 2 0 store an identifier of regions of most interest to a 

M= particular viewer, then, when those regions are subsequently 

~J shown again, the enhancements are applied to those regions 

Q of interest. 

.ssst 

] '-°* The video reconstruction routine 512 of FIG. 6 begins 

2 5 at step 602 and proceeds to step 604. At step 604, the NID 

108 receives the deconstructed bitstreams from the network. 
At step 606 , the routine 600 identifies the user device 
lattice point. Generally, this lattice point is provided 
from a lookup table such that by the model number or type of 

3 0 user device, the NID can identify a lattice point that is 

commensurate with the capabilities of the user device. The 
lookup table is generally populated by users registering 
their devices with a content provider that operates the 
system of the present invention. The selected lattice point 
35 defines the characteristics for the output video (and audio) 
that must be generated by the NID. The lattice point 
identifies the intersection of all the dimensions necessary 
to produce an appropriately formatted signal, e.g., the 
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frame rate, the color depth, the resolution, and the like. 
In addition, the NID may select an available enhancement, 
i.e., an extracted component, to be used to augment a 
particular property of the video sequence. 
5 At step 608, the streams that correspond to the lattice 

point (and any augmenting bitstreams) are selected as a 
subset of all of the bitstreams that are received by the 
NID. If the bitstreams are identified by different program 
identifiers (PIDS) in an MPEG-2 or MPEG-7 transport stream, 

10 then the selection process involves identifying the PIDS of 
the bitstreams that correspond to the selected lattice node. 
In other transport protocols, the packets of a particular 
bitstream may be identified by a unique header code. In 
either instance, the NID will then select only those 

15 bitstreams having the proper information and ignore all 

others. The video information is cumulative for orthogonal 
dimensions and components. As such, to create video of 
selected characteristics, the bitstreams are decoded and the 
resultant frames from each bitstream are added to one 

2 0 another. 

At step 610, the information from the selected 
bitstreams is combined to produce a video sequence that 
complies with the lattice point characteristics. At step 
612, the routine outputs a video sequence that is 

25 commensurate with the lattice point that was identified to 
match the user device. The routine ends at step 614. 

FIG. 7 depicts a block diagram of another embodiment of 
the invention comprising a video source 702, a 
deconstruction processor 704, a first network 706, a second 

30 network 708, a plurality of NIDs 710A, 710B, 710C and 710D 
and a plurality of network appliances 712A, 712B, 712C, and 
712D. The network may also contain one or more intermediate 
NIDs 716 that perform intermediate level processing of the 
streams . 

3 5 The video source 7 02 supplies a sequence of video 

frames to the deconstruction processor 704. The processor 
704 deconstructs the video as described above to produce a 
plurality of bitstreams. The bitstreams are coupled to the 
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first network 706. NIDs 710A and 710C are located at the 
edge of network 706 to extract content from the network 706. 
These NIDs 710A and 710C extract bitstreams that represent 
the highest quality that is required by any network 
appliance that is coupled to the NID 710A or 7 10C or any 
sub-NIDs . NIDs 710A and 710C form transition nodes between 
the two networks 7 06 and 7 08. 

Sub-NIDs 71 0B and 710D receive bitstreams from NIDs 
710A and 710C. Network appliance 712D receives bitstreams 
directly from NID 710C while network appliance 712C receives 
bitst reams from sub— NID 710D. Sub— NID 71 0B forms an edge 
interface to second network 708 comprising network 
appliances 712A and 712B as well as other NIDs. In this 
manner, NID 710A extracts bitstreams representing video 
having a quality that is sufficient for the network 
appliances in second network 708. NID 710B sends bitstreams 
into the network 708 on path 714 at a quality sufficient for 
the downstream network appliances. As such, bandwidth is 
conserved at each level . 

NID 716 is an intermediate node within a network that 
performs either video deconstruction or reconstruction or 
some combination thereof. For example, the video source may 
couple directly to a broadband network and have an 
intermediate node within the network deconstruct the video 
rather than have the deconstruction on the edge of the 
network. Within the network, one or more other intermediate 
nodes may further deconstruct one or more deconstructed 
bitstreams or may reconstruct the video for deconstruction 
at another node. Within the network any number of 
intermediate nodes may perform various combinations of 
deconstruction and reconstruction of the various bitstreams. 

In another embodiment of the invention, the NID may 
form a "headend" for a content delivery network such as 
cable television. As such, multiple users may be 
transmitted the same programs in the same format broadcast 
to them via a cable network. Thus, a single output from NID 
108 may be coupled to a plurality of user devices. 
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The present invention, as described above, produces 

from a single video sequence from a plurality of bitstreams 

that are formed in a hierarchical structure such that 

subsets of the bitstreams may be combined to produce a video 

sequence that is useful to a user device. As such, the 

video sequence can be matched to the capabilities of the 

video processor of the user device. 

The invention has been described, by way of 

illustration, in various embodiments in which the selection 

of bitstreams is based at least partly on the capabilities 

of user display devices. However, as will be clear to 

practitioners of the art in light of the teachings herein, 

other embodiments are also possible and suggested, such as 

embodiments and applications in which selection of 

bitstreams is based upon a user's desire for image quality, 

perhaps differentiated across various aspects of an image or 

images (e.g., perhaps faces or other particular portions of 

images might be sent at higher quality if users so desired, 

etc . ) . 

Although various embodiments which incorporate the 
teachings of the present invention have been shown and 
described in detail herein, those skilled in the art can 
readily devise many other varied embodiments that still 
incorporate these teachings. 



